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Abstract. Some 25 years ago Valiant introduced an algebraic model of computation 
in order to study the complexity of evaluating families of polynomials. The theory was 
introduced along with the complexity classes VP and VNP which are analogues of the 
classical classes P and NP. Families of polynomials that are difficult to evaluate (that is, 
VNP-complete) includes the permanent and hamiltonian polynomials. 
In a previous paper the authors together with P. Koiran studied the expressive power 
of permanent and hamiltonian polynomials of matrices of bounded treewidth, as well as 
the expressive power of perfect matchings of planar graphs. It was established that the 
permanent and hamiltonian polynomials of matrices of bounded treewidth are equivalent 
to arithmetic formulas. Also, the sum of weights of perfect matchings of planar graphs 
was shown to be equivalent to (weakly) skew circuits. 

In this paper we continue the research in the direction described above, and study the ex- 
pressive power of permanents, hamiltonians and perfect matchings of matrices that have 
bounded pathwidth or bounded cliquewidth. In particular, we prove that permanents, 
hamiltonians and perfect matchings of matrices that have bounded pathwidth express 
exactly arithmetic formulas. This is an improvement of our previous result for matri- 
ces of bounded treewidth. Also, for matrices of bounded weighted cliquewidth we show 
membership in VP for these polynomials. 

1 Introduction 

In this paper we continue the work that was started in fS . Our focus is on easy special cases 
of otherwise difhcult to evaluate polynomials, and their relation to various classes of arithmetic 
circuits. It is conjectured that the permanent and hamiltonian polynomials are hard to evaluate. 
Indeed, in Valiant's model [16ll7j these families of polynomials are both VNP-complete. In the 
boolean framework they are complete for the complexity class jJP [18j . However, for matrices of 
bounded treewidth the permanent and hamiltonian polynomials can efficiently be evaluated - 
the number of arithmetic operations being polynomial in the size of the matrix [3] . 

An earlier result along these lines is related to computing weights of perfect matchings 
in a graph: The sum of weights of all perfect matchings in a weighted (undirected) graph is 
another hard to evaluate polynomial, but for planar graphs it can be evaluated efficiently due 
to Kasteleyn's theorem jTU] . 
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By means of reductions these evaluation methods can all be seen as general-purpose eval- 
uation algorithms for certain classes of polynomials. As an example, if an arithmetic formula 
represents a polynomial P then one can construct a matrix A of bounded treewidth such that: 

(i) The entries of A are variables of P, or constants from the underlying field. 

(ii) The permanent of A is equal to P. 

It turns out that the converse holds as well, so with respect to the computational com- 
plexity computing the permanent of a bounded treewidth matrix is equivalent to evaluating an 
arithmetic formula. In [8 the following results (with abuse of notation) were established: 

(i) permanent/hamiltonian(bounded treewidth matrix) = arithmetic formulas. 

(ii) perfect matchings (planar matrix) = arithmetic skew circuits. 

One can also by similar techniques show that: 

(iii) perfect matchings (bounded treewidth matrix) = arithmetic formulas. 

Other notions of graph "width" have been defined in the litterature besides treewidth, e.g. 
pathwidth, cliqucwidth and rankwidth. Here we would like to study the evaluation methods men- 
tioned above, but considering matrices A that have bounded pathwidth or bounded cliquewidth 
instead of bounded treewidth. In this paper we establish the following results: 

(i) per/ham/perf. match, (bounded pathwidth matrix) = arithmetic skew circuits of bounded 
width = arithmetic weakly skew circuits of bounded width = arithmetic formulas. 

(ii) arithmetic formulas C per/ham/perfect matchings (bounded cliquewidth matrix) C VP. 

Overview of the paper. The second section of the paper introduces definitions used through- 
out the paper and provides some small technical results related to graph widths. In particular we 
show equivalence between the weighted definitions of cliquewidth, NLC- width and m-cliquewidth 
with respect to boundedness. Sections 3 and 4 are devoted to the expressiveness of the perma- 
nent, hamiltonian, and perfect matchings of the graphs of bounded pathwidth and bounded 
weighted cliquewidth respectively. We prove in Section 3 that permanent, hamiltonian, and per- 
fect matchings limited to bounded pathwidth graphs express arithmetic formulas. In Section 4, 
we show that for all three polynomials the complexity is between arithmetic formulas and VP 
for graphs of bounded weighted cliquewidth. 

2 Definitions and preHminary results 
2.1 Arithmetic circuits 

Definition 1. An arithmetic circuit is a finite, acyclic, directed graph. Vertices have indegree 
or 2, where those with indegree are referred to as inputs. A single vertex must have outdegree 
0, and is referred to as output. Each vertex of indegree 2 must be labeled by either + or x, thus 
representing computation. Vertices are commonly referred to as gates and edges as arrows. 

By interpreting the input gates either as constants or variables it is easy to prove by induction 
that each arithmetic circuit naturally represents a polynomial. 

In this paper various subclasses of arithmetic circuits will be considered: For weakly skew 
circuits we have the restriction that for every multiplication gate, at least one of the incoming 
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arrows is from a subcircuit whose only connection to the rest of the circuit is through this 
incoming arrow. For skew circuits we have the restriction that for every multiphcation gate, at 
least one of the incoming arrows is from an input gate. For formulas all gates (except output) 
have outdegree 1. Thus, reuse of partial results is not allowed. 

For a detailed description of various subclasses of arithmetic circuits, along with examples, 
we refer to [Ti] . 

Definition 2. The size of a circuit is the total number 0/ gates in the circuit. The depth of a 
circuit is the length of the longest path from an input gate to the output gate. 

2.2 Pathwidth and treewidth 

Since the definition of pathwidth is closely related to the definition of treewidth (bounded 
pathwidth is a special case of bounded treewidth) we also include the definition of treewidth in 
this paper. Treewidth for undirected graphs is commonly defined as follows: 

Definition 3. Let G = {V, E) he a graph. A k-tree-decomposition of G is: 

(i) A tree T = {Vt,Et). 

(a) For each t GVt a subset Xt QV of size at most k + 1. 
(Hi) For each edge {u,v) € E there is at €Vt such that {u,v} C Xf. 
(iv) For each vertex v the set {t ^Vt\v ^ Xt\ forms a (connected) subtree of T . 

The treewidth of G is then the smallest k such that there exists a k-tree-decomposition for G. 
A k-Y^aXh.- decomposition of G is then a k-tree-decomposition where the "tree" T is a path (each 
vertex t d Vt has at most one child in T). 

Example 1. Here we show that cycles have pathwidth at most 2 by constructing a path-decom- 
position of G where each Xt has size at most 3. Let vi^V2, ■ . . ^Vn be the vertices of a graph 
G which is a cycle. The edges of G are (fi,W2), (1^2, 1's), • • • , {vn-i,Vn), {vn,vi). The vertex vi 
is contained in every Xt of the path-decomposition. Vertices V2 and W3 are contained in Xi, 
vertices and W4 are contained in X2, and so on. Finally, vertices Wn-i and w„ are contained 
in Xn-2- This gives a path-decomposition of G of width 2. 

The pathwidth (treewidth) of a directed, weighted graph is naturally defined as the pathwidth 
(treewidth) of the underlying, undirected, unweighted graph. The pathwidth (treewidth) of an 
(n X n) matrix M = {mi_j) is defined as the pathwidth (treewidth) of the directed graph 
Gm = (VmiEmjw) where Vm = {1, ■ • ■ ,?^}, (hj) G Em iff m-i.j 7^ 0, and w{i,j) = ruij. Notice 
that Gm can have loops. Loops affect neither the pathwidth nor the treewidth of Gm but are 
important for the characterization of the permanent polynomial. 

2.3 Cliquewidtii, NLCwidth and m-cliquewidth 

Although there exists many algorithmic results for graphs of bounded treewidth, there are 
still classes of "trivial" graphs that have unbounded treewidth. Cliques are an example of 
such graphs. Cliquewidth is a different notion of "width" for graphs, and it is more general 
than treewidth since graphs of bounded treewidth have bounded cliquewidth, but cliques have 
bounded cliquewidth and unbounded treewidth. 
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We recall the definitions of cliquewidth, NLCwidth and m-cliquewidth for unweighted, undi- 
rected graphs. Then we introduce the new notions of ly-cliquewidth, M^-NLCwidth and M^-m- 
cliquewidth which are variants of the preceding ones for weighted, directed graphs. These graph 
widths are all defined using terms over an universal algebra. When we refer to parse-trees it 
means the parse-trees of these terms. 

Definition 4 ( [3||5] ). A graph G has cliquewidth (denoted cwdiG)) at most k iff there exists a 
set of source labels S of cardinality k such that G can he constructed using a finite number of 
the following operations (named clique operations): 

(i) vera, a G S (basic construct: create a single vertex with label a), 
(ii) pa^b{H), a,b G S (rename all vertices with label a to have label b instead). 
(Hi) rja,b{H), a,b £ S, a =/= b (add edges between all couples of vertices where one of them has 

label a and the other has label b). 
(iv) H © H' (disjoint union of graphs). 

Example 2. Using the clique algebra, the clique with four vertices is constructed by the 
following term using only two source labels; S = {a, 6}: 

'na.b{{pa^b{Va,b{{Pa^b{ria,b{vera ® vevb))) © veTa))) © Vera). 

Definition 5 ([19j). A graph G has NLCwidth (denoted wnlc{G)) at most k iff there exists 
a set of source labels S of cardinality k such that G can be constructed using a finite number of 
the following operations (named NLC operations): 

(i) vera, a Cz S (basic construct: create a single vertex with label a). 

(ii) ojj.(iJ) for any mapping R from S to S (for every source label a G S rename all vertices 
with label a to have label R{a) instead). 

(Hi) H Xs H' for any S <^ (disjoint union of graphs to which are added edges between all 
couples of vertices x £ H (with label l^), y G H' (with label ly) having (l^, ly) £ S). 

One important distinction between cliquewidth and NLCwidth on one side and m-cliquewidth 
(to be defined below) on the other side is that in the first two each vertex is assigned exactly 
one label, and in the last one each vertex is assigned a set of labels (possibly empty). 

Definition 6 ([6]). A graph G has m-cliquewidth (denoted mcwd{G) ) at most k iff there exists 
a set of source labels S of cardinality k such that G can be constructed using a finite number of 
the following operations (named m-clique operations): 

(i) verA (basic construct: create a single vertex with a set of labels A, ACS). 

(ii) H ®s,h.h' H' for any S C and any h,h' : V{S) — > P{S) (disjoint union of graphs to 
which is added edges between all couples of vertices x £ H , y G H' whose sets of labels L^, Ly 
contain a couple of labels Ix, ly such that (l^, ly) G S . Then the labels of vertices from LI are 
changed via h and the labels of vertices from H' are changed via h' ). 

It is stated in [B] (a proof sketch of this result is given in ^6, , one of the inequalities is proven 
in 0) that 

mcwd{G) < wdNLciG) < cwd{G) < 2™='"'^(G')+i _ ^ 
Hence, cliquewidth, NLC-width and m-cliquewidth are equivalent with respect to boundedness. 
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We have seen that the definition of pathwidth and treewidth for weighted graphs straight 
forward was defined as the width of the underlying, unweighted graph. This is a major differ- 
ence compared to cliqucwidth. We can see that if we consider non-edges as edges of weight 0, 
then every weighted graph has a chque (which has bounded chquewidth 2) as its underlying, 
unweighted graph. 

Our main motivation for studying bounded cliquewidth matrices is to obtain efficient algo- 
rithms for evaluating polynomials like the permanent and hamiltonian for such matrices. For 
this reason, it is not reasonable to define the cliquewidth of a weighted graph as the cliquewidth 
of the underlying, unweighted graph, because then computing the permanent of a matrix of 
cliquewidth 2 is as difficult as the general case. Hence, we put restrictions on how weights are 
assigned to edges: Edges added in the same operation between vertices having the same pair of 
labels, will all have the same weight. 

We now introduce the definitions of PF-cliquewidth, M^-NLCwidth and W-m-cliquewidth. We 
will consider simple, weighted, directed graphs where the weights are in some set W. In the three 
following constructions, an arc from a vertex a; to a vertex y is only added by relevant operations 
if there is not already an arc from x to y. The operations that differ from the unweighted case 
are indicated by bold font. 

Definition 7. A graph G has W- cliquewidth (denoted Wcwd{G)) at most k iff there exists a 
set of source labels S of cardinality k such that G can be constructed using a finite number of 
the following operations (named W-clique operations): 

(i) vera, a G S (basic construct: create a single vertex with label a), 
(ii) pa^b{H), a,b G S (rename all vertices with label a to have label b instead). 
(Hi) oi^^{H), a,b e S, a b, w € W (add missing arcs of weight w from all vertices with label 

a to all vertices with label b). 
(iv) H ® H' (disjoint union of graphs). 

Definition 8. A graph G has W-NLCwidth (denoted WwdNLc{G)) at most k iff there exists 
a set of source labels S of cardinality k such that G can be constructed using a finite number of 
the following operations (named W-NLC operations): 

(i) vera, a G iS (basic construct: create a single vertex with label a). 

(ii) oji[H) for any mapping R from S to S (for every source label a G S rename all vertices 
with label a to have label R{a) instead). 

(Hi) H Xg H' for any partial function 5':5^x{ — 1,1}— (disjoint union of graphs to which 
are added arcs of weight w for each couple of vertices x € H , y G H' whose labels Ix, ly are 
such that S{lx, ly, s) = w; the arc is from x to y if s = 1 and from y to x if s = —1). 

Definition 9. A graph G has W-m- cliquewidth (denoted Wmcwd{G) ) at most k iff there exists 
a set of source labels S of cardinality k such thai G can be constructed using a finite number of 
the following operations (named W-m-clique operations): 

(i) verA (basic construct: create a single vertex with set of labels A, ACS), 
(ii) H 'E)s,h,h' H' for any partial function 5 : <S^ x {—1, 1} — >■ W and any h, h' : 'P{S) 

(disjoint union of graphs to which is added missing arcs of weight w for each couple of 
vertices x G H , y G H' whose sets of labels Lx,Ly contain lx,ly such that S{lx,ly,s) = w; 
the arc is from x to y if s = 1 and from y to x if s = —1. Then the labels of vertices from 
H are changed via h and the labels of vertices from H' are changed via h' ). 
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In the last operation for W^-m-cliquewidth, there is a possibility that two (or more) arcs are 
added from a vertex a; to a vertex y during the same operation and then the obtained graph is 
not simple. For this reason, we will consider as well-formed terms only the terms (or parse-trees) 
where this does not occur. 

The three preceding constructions of graphs can be extended to weighted graphs with loops 
by adding the basic constructs verloop^ or verloop^ which creates a single vertex with a loop 
of weight w and label a or set of labels A. If G is a weighted graph (directed or not) with loops 
and Unloop{G) denotes the weighted graph (directed or not) obtained from G by removing all 
loops, then one can easily show the following result. 

- Wcwd{G) = Wcwd{Unloop{G)). 

- WwdNLciG) = WwdNLc{Unloop{G)). 

- Wmcwd{G) — Wmcwd{Unloop{G)). 

This justifies the fact that we overlook technical details for loops in the proof of the following 
theorem. Theorem [1] shows that the inequalities between the three widths are still valid in the 
weighted case. It justifies our definitions of cliquewidth for weighted graphs. For the proof we 
collect the ideas in 6 9 and combine them with our definitions for weighted graphs. 

Theorem 1. For any weighted graph G, 

Wmcwd{G) < WwdNLciG) < Wcwd{G) < 2i^™^'"^(G)+i _ i. 

Proof. First inequality: 

Let G be a weighted graph of ly-NLCwidth at most k and T be a parse-tree constructing G 
with W-NLC operations on a set of source labels S of cardinality k. We can consider without 
loss of generality that in T: 

- there are no two consecutive o^(iJ) operations, otherwise we can replace T by T' where the 
two consecutive nodes of T with o^(iJ) and oj^,(^H) operations on them have been replaced 
by one node oj^„{H) {R" = R' oR). 

- no vera operation is followed by a Of([H) operation, otherwise we can replace T by T' where 
this two operations are replaced by ver^ where b — R{a). 

- each H Xs H' operation is followed by exactly one 0]^(H) operation, otherwise we can add 
an oij^(H) operation if there is none (Id is the identity function from S to S). 

We can replace the H^-NLC operation vera by the VF-m-clique operation wer^a}, and the con- 
secutives VF-NLC operation H Xs H' and ob,{H) by the W^-m-clique operation H ®s,h,h H' 
where h{{a}) — {i?(a)},Va e S. It is clear that these replacements in T will give a parse-tree 
constructing G with T^-ni-clique operations on the same set of source labels S of cardinality k. 
Hence, we have Wmcwd{G) < WwdNLciG)- 
Second inequality: 

Let G be a weighted graph of VF-cliquewidth at most k and T be a parse-tree constructing G 
with VF-clique operations on a set of source labels S of cardinality k. We can consider without 
loss of generality that in T: 

- after a disjoint union operation H (B H' all arcs in G from x € H to y G H' (resp. from 
y to x) are added between the disjoint union operation H (B H' and the first following 
operation O of disjoint union or renaming. Otherwise consider the first operation a™j(i?) 
after O adding an arc between a vertex x' from H and a vertex y' from H'. We can add 
an operation a™, f,/(i?) before O where a' (resp. b') is the label in H (B H' of the tail (resp. 
head) of the arc added by the operation a'^^{H). 
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- each operation ^{H) add at least one arc. 

- all ct^i,{H) operations are between a disjoint union operation H (B H' and the first following 
operation O of disjoint union or renaming. 

We can replace the VF-clique operation vera by the T^-NLC operation vera, and the M^-clique 
operation pa-,b{H) by the VF-NLC operation o^(iJ) where R{a) = b and R{c) = c,Wc ^ S,c^ a. 
Finally each group consisting of a © H' W^-clique operation and the following a^f^{H) 
VF-clique operations can be replaced by the W-NLC operation G Xs G' where S{a,b,l) — 
S{a, b, —1) = w if there is an d^^^lH) operation in the group. It is clear that these replacements 
in T will give a parse-tree constructing G with VF-NLC operations on the same set of source 
labels iS of cardinality k. Hence, we have WwdNLc{G) < Wcwd{G). 
Last inequality: 

Let G be a weighted graph of ly-m-cliquewidth at most k and T be a parse-tree constructing 
G with ly-m-clique operations on a set of source labels S of cardinality k. Let S' be a set of source 
labels of cardinality 2*^+^ — 1,5' = SiUSrU{empty} where = \Sr\ = 2^ — 1. We define three 
bijcctions I : V[S)\% Si, r : r{S)\% Sr, and u : Si ^ Sr such that u{l{A)) = r(A),VA G 
Vis). We will denote by p/ a sequence of pa^b W^-clique operations realizing a function / from 
S' to S'. We associate to each function S : S"^ x { — 1,1} — > a sequence as consisting of 
'^T{A) r{B) (resp. ajf^g) j^^^) W^-clique operations for all couples (a, 6) e 5^, {A,B) e (■p(5)\0)^ 
such that S{a, b,l) = w (resp. S{a, b, —1) = w), a G A and b E B. 

We can replace the VF-m-clique operation verA by the VF-clique operation i'er/(^) if yl 
and ver empty otherwise. Each IV-m-clique operation H'Sis,h,h' H' will be replaced by the following 
W-clique operations: 

- apply p„ to the subtree constructing H' . 

- make a. H ® H' ly-clique operation. 

- apply as- 

- apply piohoi-^- 

- apply pioh'or-^- 

It is clear that these replacements in T will give a parse-tree constructing G with ly-clique 
operations on the set of source labels S' of cardinality 2*^+^ — 1. Hence, we have Wcwd{G) < 

2W7nc'wd{G) + l ^ [— ] 

2.4 Permanent and hamiltonian polynomials 

In this paper we take a graph theoretic approach to deal with permanent and hamiltonian poly- 
nomials. The reason for this is that a natural way to define pathwidth, treewidth or cliquewidth 
of a matrix M is by the width of the graph Gm (see Section . also see e.g. [T2]. 

Definition 10. A cycle cover of a directed graph is a subset of the edges, such that these edges 
form disjoint, directed cycles (loops are allowed). Furthermore, each vertex in the graph must be 
in one ( and only one ) of these cycles. The weight of a cycle cover is the product of weights of 
all participating edges. 

Definition 11. The permanent of an (n x n) matrix M ~ {mi_j) is the sum of weights of all 
cycle covers ofGM- 
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The permanent of M can also be defined by the formula 

n 

per(M) = 

The equivalence with Definition [11] is clear since any permutation can be written down as 
a product of disjoint cycles, and this decomposition is unique. The hamiltonian polynomial 
ham(M) is defined similarly, except that we only sum over cycle covers consisting of a single 
cycle (hence the name). 

There is a natural way of representing polynomials by permanents. Indeed, if the entries 
of M are variables or constants from some field K, then / = per(Af) is a polynomial with 
coefficients in K (in Valiant's terminology, / is a projection of the permanent polynomial). In 
the next sections we study the power of this representation in the case where M has bounded 
pathwidth or bounded cliquewidth. 

2.5 Connections betvifeen permanents and sum of weights of perfect matchings 

Another combinatorial characterization of the permanent is by sum of weights of perfect match- 
ings in a bipartite graph. We will use this connection to deduce results for the permanent from 
results for the sum of weights of perfect matchings and vice versa. 

Definition 12. Let G be a directed graph (weighted or not). We define the inside-outside graph 
ofG, denoted IO{G), as the bipartite, undirected graph (weighted or not) obtained as follows: 

— split each vertex u e V{G) in two vertices and ; 

— each arc uv (of weight w) is replaced by an edge between and v~ (of weight w). A loop 
on u (of weight w) is replaced by an edge between and u~ (of weight w). 

It is well-known that the permanent of a matrix M can be defined as the sum of weights of all 
perfect matchings of IO{Gm). We can see that the adjacency matrix of IO{Gm) is 

Lemma 1. // G has treewidth (pathwidth) k, then 10(G) has treewidth (pathiuidth) at most 
2-k+l. 

Proof. Let (T, {Xt)tev{T)) be a fc-tree(path)-decomposition of G. It is clear that (T, {X^)tev{T)), 
where = {u^,u^\u G Xt}, is a tree(path)-decomposition of IO{G) of width 2 ■ fc -f 1. □ 

Lemma 2. If G has W -cliquewidth k, then IO{G) has W -cliquewidth at most 2 ■ k. 

Proof. Let T be a parse-tree constructing G with ly-clique operations on a set of source la- 
bels S of cardinality k. We can replace the I^-clique operation vera by the three operations 
(wer(j+) [ver^-), and the VF-clique operation pa^b{H) by the M^-clique operations Pa+->b+{H) 
and Pa-^b- (H). Finally each af^^[H) ly-clique operation can be replaced by the rf^+ j_ [H) W- 
clique operation. It is clear that these replacements in T will give a parse-tree constructing 
IO{G) with Vl^-clique operations on the set of source labels {a+, a^\a € S} of size 2 • fc. □ 
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3 Expressiveness of matrices of bounded pathwidth 



In this section we study the expressive power of permanents, hamiltonians and perfect matchings 
of matrices of bounded pathwidth. We wiU prove that in each case we capture exactly the famihes 
of polynomials computed by polynomial size skew circuits of bounded width. A by-product of 
these proofs will be a proof of the equivalence between polynomial size skew circuits of bounded 
width and polynomial size weakly skew circuits of bounded width. This equivalence can not be 
immediately deduced from the already known equivalence between polynomial size skew circuits 
and polynomial size weakly skew circuits in the unbounded width case [15^ (the proofs in [15j 
use a combinatorial characterization of the complexity of the determinant as the sum of weights 
of s, i-paths in a graph of polynomial size with distinguished vertices s and t. The additional 
difficulties to extend these proofs to circuits and graphs of bounded width would be equivalent 
to the ones we deal with). We will then prove that skew circuits of bounded width are equivalent 
to arithmetic formulas. 

Definition 13. An arithmetic circuit tp has bounded width k > 1 if there exists a finite set of 
totally ordered layers such that: 

- Each gate of (p is contained in exactly 1 layer. 

- Each layer contains at most k gates. 

- For every non-input gate of (p if that gate is in some layer n, then both inputs to it are in 
layer n + 1. 

Theorem 2. The polynomial computed by a weakly skew circuit of bounded width can be ex- 
pressed as the permanent of a matrix of bounded pathwidth. The size of the matrix is polynomial 
in the size of the circuit. All entries in the matrix are either 0, 1, constants of the polynomial, 
or variables of the polynomial. 

Proof. Let (/? be a weakly skew circuit of bounded width fc > 1 and / > 1 the number of layers 
in if. The directed graph G we construct will have pathwidth at most \J-j-\ — 1 (each bag in the 
path-decomposition will contain at most [-^J vertices) and the number of bags in the path- 
decomposition will be / — 1. G will have two distinguished vertices s and t, and the sum of 
weights of all directed paths from s to i equals the value computed by tp. The vertex s will be 
in all bags of the path-decomposition of G. 

Since (/? is a weakly skew circuit we consider a decomposition of it into disjoint subcircuits 
defined recursively as follows: The output gate of ip belongs to the main subcircuit. If a gate in 
the main subcircuit is an addition gate, then both of its input gates are in the main subcircuit 
as well. If a gate g in the main subcircuit is a multiplication gate, then we know that at 
least one input to g is the output gate of a subcircuit which is disjoint from (p except for 
its connection to g. This subcircuit forms a disjoint multiplication-input subcircuit. The other 
input to g belongs to the main subcircuit. If some disjoint multiplication-input subcircuit ip' 
contains at least one multiplication gate, then we make a decomposition of tp' recursively. Note 
that such a decomposition of a weakly skew circuit not necessarily is unique (nor does it need 
to be), because both inputs to a multiplication gate can be disjoint from the rest of the circuit, 
and then any one of these two can be chosen as the one that belongs to the main subcircuit. 

Let ipo,ipi, . . . ,ipd be the disjoint subcircuits obtained in the decomposition [tpQ is the main 
subcircuit) . The graph G will have a vertex Vg for every gate g oi ip and d-\-l additional vertices 
s = So, si, . . . , Sd (t will correspond to Vg where g is the output gate of ip). For every gate g 
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in the subcircuit Lpi, the foUowing construction wiU ensure that the sum of weights of directed 
paths from Si to Vg is equal to the value computed at g in ip. 

For the construction of G we process the decomposition of (/3 in a bottom-up manner. Let 
subcircuit Lpi be a leaf in the decomposition of (so Lpi consists solely of addition gates and 
input gates). Assume that ipi is located in layers topi through boti (1 > topi > boU > I) of ip. 
First we add a vertex Si to G in bag boti — 1, and for each input gate with value w in the bottom 
layer boti of (pi we add a vertex to G also in bag boti ~ 1 along with an edge of weight w from s,; 
to that vertex. Let n range from boti — 1 to topi : Add the already created vertex Si to bag n — 1 
and handle input gates of ipi in layer n as previously described. For each addition gate of ipi in 
layer n we add a new vertex to G (which is added to bags n and n — 1 of the path-decomposition 
of G). In bag n we already have two vertices that represent inputs to this addition gate, so we 
add edges of weight 1 from both of these to the newly added vertex. The vertex representing 
the output gate of the circuit ipi is denoted by ti . The sum of weighted directed paths from Si 
to ti equals the value computed by the subcircuit ipi. 

Let ipi be a subcircuit in the decomposition of p that contains multiplication gates. Addition 
gates and input gates in ipi are handled as before. Let 5 be a multiplication gate in ipi in layer 
n and ipj the disjoint multiplication-input subcircuit that is one of the inputs to g. We know 
that vertices Sj and tj already are in bag n, so we add an edge of weight 1 from the vertex 
representing the other input to g to the vertex Sj, and an edge of weight 1 from tj to a newly 
created vertex Vg that represents gate g, and then Vg is added to bags n and n — 1. 

For every b (1 > b > I — I) we need to show that only a constant number of vertices are 
added to bag b during the entire process. Every gate in layer 6 of iy9 is represented by a vertex, 
and these vertices may all be added to bag b. Every gates in layer 6-1-1 are also represented by 
a vertex, and all of these are added to bag b (because they are used as input here). So far we 
have at most 2 • k gate vertices in each bag. In addition a number of Si vertices are also added 
to bag b. For each subcircuit ipj that has a gate in layer 6 or 6 -I- 1, we have the corresponding Sj 
vertex in bag 6, so what remains is to show that at most [^J disjoint subcircuits have a gate 
in layer b 01 b + 1. Each of these subcircuits are in exactly one of the following 3 sets: 

Ci: Subcircuits that have a gate in layer b, but NONE of them are multiplication gates. 
C2: Subcircuits that DO have a multiplication gate in layer b. 
C3: Subcircuits that have their root in layer b + 1. 

There are at most [|J subcircuits in the set C2. Otherwise, since two inputs to a multiplication 
gate are in different subcircuits and since subcircuits in C2 are disjoint layer 6-1-1 would contain 
at least 2 • ( [|J -f- 1) gates and thus have width more than k. By how subcircuits are constructed, 
all subcircuits in C3 are considered as the disjoint multiplication-input subcircuit of distinct 
multiplication gates in layer b, so there are at least jCaj multiplication gates in layer b. Since 
subcircuits in Ci do NOT have multiphcation gates in layer b we have that |Ci| -I- IC3I < k. 
Thus, at most \Ci \ + IC2I -I- IC3I < [^J distinct subcircuits have their Si vertex added to bag b. 

Note that in layer 1 of we just have the output gate. This gate is represented by the vertex 
i of G which is in bag 1 of the path-decomposition. 

The sum of weights of all directed paths from s to t in G can by induction be shown to be 
equal to the value computed by ip. The final step in the reduction to the permanent polynomial 
is to add an edge of weight 1 from t back to s and loops of weight 1 at all nodes different from 
s and t. □ 

The proof of Theorem [2] can be modified to work for the hamiltonian polynomial as well. 
We adapt the idea used to show universality of the hamiltonian polynomial in [T3]. For the 
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permanent polynomial each bag in the path-decomposition contains at most [-^J vertices; for 
each of these vertices we now need to introduce one extra vertex in the same bag. In addition 
each bag must contain 2 more vertices in order to establish a connection to adjacent bags in the 
path-decomposition. In total each bag now contains at most 7 ■ k + 2 vertices. 

Theorem 3. The polynomial computed by a weakly skew circuit of bounded width can be ex- 
pressed as the sum of weights of perfect matchings of a symmetric matrix of bounded pathwidth. 
The size of the matrix is polynomial in the size of the circuit. All entries in the matrix are either 
0, 1, constants of the polynomial, or variables of the polynomial. 

Proof. It is a direct consequence of Theorem [2] and Lemma [TJ □ 

Now we prove that the permanent, the hamiltonian, and the sum of weights of perfect 
matchings of a bounded pathwidth graph can be expressed as a skew circuit of bounded width. 

Theorem 4. The hamiltonian of a matrix of bounded pathwidth can be expressed as a skew 
circuit of bounded width. The size of the circuit is polynomial in the size of the matrix. 

Proof. Let M be a matrix of bounded pathwidth k and let Gm be the underlying, directed 
graph. Each bag in the path-decomposition of Gm contains at most fc -I- 1 vertices. We refer to 
one end of the path-decomposition as the leaf of the path-decomposition and the other as the 
root (recall that path-decompositions are special cases of tree-decompositions). 

We process the path-decomposition of Gm from the leaf towards the root. The overall idea 
is the same as the proof of Theorem 5 in [8] - namely to consider weighted partial path covers 
(i.e. partial covers consisting solely of paths) of subgraphs of Gm that are induced by the 
path-decomposition of Gm- During the processing of the path-decomposition of Gm at every 
level distinct from the root, new partial path covers are constructed by taking one previously 
generated partial path cover and then add at most (fc -I- 1)^ new edges, so all the multiplication 
gates we have in our circuit are skew. For any bag in the path-decomposition of Gm we only 
need to consider a number of partial path covers that depends solely on k, so the circuit we 
produce has bounded width. At the root we add sets of edges to partial path covers to form 
hamiltonian cycles. □ 

Theorem 5. The sum of weights of perfect matchings of a symmetric matrix of bounded path- 
width can be expressed as a skew circuit of bounded width. The size of the circuit is polynomial 
in the size of the matrix. 

Proof. Let M be a symmetric matrix of bounded pathwidth fc and let Gm be the underlying, 
undirected graph. Each bag in the path-decomposition of Gm contains at most fc + 1 vertices. 

We process the path-decomposition of Gm from the leaf towards the root. The proof is very 
similar to the proof of Theorem [4]- namely to consider weighted matchings of subgraphs of Gm 
that are induced by the matching of Gm- During the processing of the matching of Gm at every 
level distinct from the root, new matchings are constructed by taking one previously generated 
matching and then add at most (fc -f- 1)^ new edges, so all the multiplication gates we have in 
our circuit are skew. For any bag in the path-decomposition of Gm we only need to consider a 
number of matchings that depends solely on fc, so the circuit we produce has bounded width. At 
the root we sum only the weights of perfect matchings to obtain the output of the circuit. □ 

Theorem 6. The permanent of a matrix of bounded pathwidth can be expressed as a skew circuit 
of bounded width. The size of the circuit is polynomial in the size of the matrix. 
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Proof. It is a direct consequence of Theorem [S] and Lemma [TJ □ 

Corollary 1. A family of polynomials is computable by polynomial size skew circuits of bounded 
width if and only if it is computable by polynomial size weakly skew circuits of bounded width. 

Proof. It is trivial to see that a family of polynomials computed by polynomial size skew circuits 
of bounded width can be computed by polynomial size weakly skew circuits of bounded width. 
Conversely, if a family of polynomials is computed by polynomial size weakly skew circuits of 
bounded width then by Theorem [2] it can be expressed as the permanents of boimded pathwidth 
graphs which can be computed by polynomial size skew circuits of bounded width according to 
Theorem El □ 

We need the following Theorem from [T] to prove the equivalence between polynomial size 
skew circuits of bounded width and polynomial size arithmetic formulas. 

Theorem 7. Any arithmetic formula can be computed by a linear bisection straight-line program 
of polynomial size that uses three registers. 

Let . . . , Rm be a set of m registers, a linear bijection straight-line (LBS) program is a 
vector of m initial values given to the registers plus a sequence of instructions of the form 

(i) R-j ^ Rj + X c), or 

(ii) Rj ^ Rj - (Ri X c), or 

(iii) Rj ^ Rj + {Ri X Xy,),or 

(iv) Rj ^ Rj - {Ri X Xy,), 

where ^ < i, j < rn, i ^ j , 1 < w < n, c is a constant, and xi, . . . ,Xn are variables {n is the 
number of variables). We suppose without loss of generality that the value computed by the 
LBS program is the value in the first register after all instructions have been executed. 

Theorem 8. A family of polynomials is computable by polynomial size skew circuits of bounded 
width if and only if it is computable by a family of polynomial .size arithmetic formulas. 

Proof. Let (/„) be a family of polynomials computable by polynomial size skew circuits of 
bounded width, then by Theorem[5]it can be expressed as the permanents of bounded pathwidth 
graphs. Since graphs of bounded pathwith have bounded treewidth, we know by Theorem 5 in [5] 
that it can be computed by a family of polynomial size arithmetic formulas. 

Conversely, if (/„) is a family of polynomial size arithmetic formulas, then by Theorem[7l it is 
computable by linear bijection straight-line programs of polynomial size that use three registers. 
We will modify these programs to obtain equivalent skew circuits of width 6. At each step, the 
set of indices {i, j, k} will be equal to {1, 2, 3}. 

Suppose the initial values of the three registers are ri , r2 , ra , then the first layer of our skew 
circuit contains three input gates with the three values ?'i,r2,r3 along with two others inputs 
which will be defined according to the next instruction in the straight-line program. 

If the next instruction is Rj <— Rj + {Ri x U) where J7 is a variable or a constant, then 
we assign the values and U to the two input gates not already defined in the current layer 
I and we create a new layer I — 1 with three addition gates corresponding to Ri,Rj,Rk whose 
inputs are the gate corresponding to Ri (resp. Rj,Rk) in layer / and the input with value in 
layer I. We also put a multiplication gate whose inputs are the gate corresponding to Ri and the 
input with value U in layer I. And we put again an input gate with value 0. Then we create a 
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new layer I — 2 with three addition gates corresponding to Ri , Rj , Rk whose inputs are the gate 
corresponding to Ri (resp. Rj,Rk) and the input with value for i,k or the gate computing 
{Ri X U) for j in layer I — 1. We also put two others inputs which will be defined according to 
the next instruction. 

If the next instruction is Rj <— Rj — (Ri x U), then we need to create one more layer than 
in the first case. We first assign the values and U to the two input gates not already defined 
in the current layer I and we create a new layer I — 1 with three addition gates corresponding 
to Ri,Rj,Rk whose inputs are the gate corresponding to Ri (resp. Rj.Rj,) in layer I and the 
input with value in layer I. We also put a multiplication gate whoso inputs are the gate 
corresponding to Ri and the input with value U in layer I. And we put again an input gate with 
value and another one with value —1. Then we create an intermediate new layer 1 — 2 with 
three addition gates corresponding to Ri, Rj,Rh whose inputs are the gate corresponding to Ri 
(resp. Rj,Rk) and the input with value 0. We also put a multiplication gate whose inputs are the 
gate computing (Ri x U) and the input with value —1 in layer I — 1. And we put again an input 
gate with value 0. Finally we create a new layer I — 3 with three addition gates corresponding 
to Ri, Rj,Rk whose inputs are the gate corresponding to Ri (resp. Rj,Rk) and the input with 
value for i, k or the gate computing —{Ri x U) for j in layer I — 2. We also put two others 
inputs which will be defined according to the next instruction. 

In both cases, it is clear by induction that the three gates of the current layer corresponding 
to Ri, Rj,Rk are computing the values in these registers if we execute the instructions treated 
so far. Hence the result. □ 



4 Expressiveness of matrices of bounded weighted cliquewidth 

In this section wc study the expressive power of pcrmanents, hamiltonians and perfect matchings 
of matrices that have bounded weighted cliquewidth. 

We first prove that every arithmetic formula can be expressed as the permanent, hamiltonian, 
or sum of weights of perfect matchings of a matrix of bounded VF-cliquewidth, using the results 
for the bounded pathwidth matrices and the following lemma. 

Lemma 3. Let G be a weighted graph (directed or not) with weights inW. If G has pathwidth 
k, then G has W -cliquewidth at most k + 2. 

Proof. Let (T, {Xt)fQV{T)) be a fc-path-decomposition of G. We refer to one end of the path- 
decomposition as the leaf of the path-decomposition and the other as the root. Let Gt be the 
subgraph of G induced by the vertices in bags below Xj. 

We prove by induction on the height of (T, (-'^t)tev'(T)) that every graph Gt can be con- 
structed by VF-clique operations using at most k + 2 distinct labels. Moreover, at the end of this 
construction all vertices in bag Xt have distinct labels and all other vertices have a sink label. 

If |V^(T)| = 1 then G has at most A: -|- 1 vertices. We can create them with A: -|- 1 distinct 
labels and add independently each edge between two vertices using W-clique operations. 

Suppose |V(T)| > 1, let r be the root and t be its child. By induction, Gt can be constructed 
by VF-clique operations using at most k + 2 distinct labels. For all vertex v G Xt\Xr, we add a 
renaming operation which gives sink label to v (this renaming operation renames only v since, 
by induction, v has distinct label from other vertices). Since \Xr\ < k + 1 and all vertices in 
V{G)\Xr have sink label, we can create the vertices of Xr\Xt with distinct labels and add 
them by disjoint union to the current construction. It is now clear that all the vertices of Xr 
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have distinct labels thus we can add independently each edge between two vertices. Hence the 
conclusion. □ 



Theorem 9. Every arithmetic formula can be expressed as the permanent of a matrix of W- 
cliquewidth at most 22 and size polynomial in n, where n is the size of the formula. All entries 
in the matrix are either 0, 1, constants of the formula, or variables of the formula. 

Proof. Let </? be a formula of size n. Due to the proof of Theorem [SJ we know that it can be 
computed by a skew circuit of width 6 and size 0(n'-'(^^). Hence it is equal to the permanent of 
a graph of size 0{nP'^^^), pathwidth at most [^J — 1 = 20 by Theorem [21 and M^-cliquewidth 
at most 20 + 2 = 22 by Lemma |3l □ 

For the hamiltonian the W-cliquewidth becomes ((7 • 6 + 2) — 1) + 2 — 45 instead. 

Theorem 10. Every arithmetic formula can be expressed as the sum of weights of perfect match- 
ings of a symmetric matrix of W -cliquewidth at most 44 and size polynomial in n, where n is 
the size of the formula. All entries in the matrix are either 0, 1, constants of the formula, or 
variables of the formula. 

Proof. It is a direct consequence of Theorem [HI and Lemma [H □ 

Alternatively we can modify the constructions of bounded treewidth graphs expressing formu- 
las in [8] . These modifications require more work than the preceding proofs but we obtain smaller 
constants since we obtain graphs of VF-cliquewidth at most 13/34/26 (instead of 22/45/44) whose 
permanent/hamiltonian/sum of weights of perfect matchings are equal to formulas. The proofs 
of these constants are given in the Appendix. 

Due to our restrictions on how weights are assigned in our definition of bounded VF-clique- 
width it is not true that weighted graphs of bounded treewidth have bounded PF-cliquewidth. 
In fact, if one tries to follow the proofs in [512] that show that graphs of bounded treewidth 
have bounded cliquewidth, then one obtains that a weighted graph G of treewidth k has W- 
cliquewidth at most 3 • {\Wg\ + or 3 • (Z\ + 1)*-'"^. Wq denotes the set of weights on the 

edges of G and A is the maximum degree of G. Weighted trees still have bounded weighted 
cliquewidth (the bound is 3), but we can show that there exists a family of weighted graphs 
with treewidth 2 and imbounded M^-cliquewidth [TT]. 

We now turn to the upper bound on the complexity of the permanent, hamiltonian, and sum 
of weights of perfect matchings of graphs of bounded weighted cliquewidth. We show that in all 
three cases the complexity is at most the complexity of VP. 

The decision version of the hamiltonian cycle problem has been shown to be polynomial 
time solvable in f7j for matrices of bounded cliquewidth. Here we extend these ideas in order to 
compute the hamiltonian polynomial efficiently (in VP) for bounded ly-m-cliquewidth matrices. 

Definition 14. A path cover of a directed graph G is a subset of the edges of G, such that 
these edges form disjoint, directed, non-cyclic paths in G. We require that every vertex of G is 
in (exactly) one path. For technical reasons we allow "paths" of length 0, by having paths that 
start and end in the same vertex. Such constructions do not have the same interpretation as a 
loop. The weight of a path cover is the product of weights of all participating edges ( in the special 
case where there are no participating edges the weight is defined to be 1). 

Theorem 11. The hamiltonian ofannxn matrix of bounded W -m- cliquewidth can be expressed 
as a circuit of size 0(n°(i)) and thus is in VP. 
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Proof. Let M be an n x n matrix of bounded W-m-cliquewidth. By G we denote the underlying, 
directed, wciglitcd graph for M. The circuit is constructed based on the parsc-trcc T for G. By 
Tt we denote the subtree of T rooted at t for some node t By Gt we denote the subgraph 
of G constructed from the parse-tree Tt. 

The overall idea is to produce a circuit that computes the sum of weights of all hamiltonian 
cycles of G. To obtain this there will be non-output gates that compute weights of all path 
covers of all Gt graphs, and then wc combine these subresults. Of course, the total number of 
path covers can grow exponentially with the size of Gt, so wc will not "describe" path covers 
directly by the edges participating in the covers. Instead we describe a path cover of some Gt 
graph by the labels associated with the start- and end-vertices of the paths in the cover. Such 
a description do not uniquely describe a path cover, because two different path covers of the 
same graph can contain the same number of paths and all these paths can have the same labels 
associated. However, we do not need the weight of each individual path cover. If multiple path 
covers of some graph Gt share the same description, then we just compute the sum of weights 
of these path covers. 

For a leaf in the parse-tree T of G we construct a single gate of constant weight 1, representing 
a path cover consisting of a single "path" of length 0, starting and ending in a vertex with the 
given labels. Per definition this path cover has weight 1. 

For an internal node t Cz T the grammar rule describes which edges to add and how to relabel 
vertices. We obtain new path covers by considering a path cover from the left child of t and a 
path cover from the right child of t: For each such pair of path covers consider all subsets of 
edges added at node t, and for every subset of edges check if the addition of these edges to the 
pair of path covers will result in a valid path cover. If it does, then add a gate that computes 
the weight of this path cover, by multiplying the weight of the left path cover, the weight of the 
right path cover and the total weight of the newly added edges. After all pairs of path covers 
have been processed, check if any of the resulting path covers have the same description - namely 
that the number of paths in some path covers are the same, and that these paths have the same 
labels for start- and end-vertices. If multiple path covers have the same description then add 
addition gates to the circuit and produce a single gate which computes the sum of weights of all 
these path covers. 

For the root node r of T we combine path covers from the children of r to produce hamiltonian 
cycles, instead of path covers. Finally, the output of the circuit is a summation of all gates 
computing weights of hamiltonian cycles. 

Proof of correctness: The first step of the proof is by induction over the height of the parse- 
tree T. We will show that for each non-root node t ofT there is for every path cover description 
of Gt a corresponding gate in the circuit that computes the sum of weights of all path covers of 
Gt with that description. For the base cases - leaves of T - it is trivially true. 

For the inductive step we consider two disjoint graphs that are being connected with edges 
at a node t of the parse-tree T. Edges added at node t arc only added in here, and not at any 
other nodes in T, so every path cover of Gt can be split into 3 parts: A path cover of Gj, , a 
path cover of Gt^ and a polynomial number of edges added at node t. Consider a path cover 
description along with all path covers of Gt that have this description. All of these path covers 
can be split into 3 such parts, and by our induction hypothesis the weights of the path covers 
of Gti and Gt^ are computed in already constructed gates. 

In order to complete the proof of correctness we have to handle the root t of T in a special 
way. At the root we do not compute weights of path covers, but instead compute weights of 
hamiltonian cycles. Every hamiltonian cycle of G can (similarly to path covers) be split into 3 
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parts: A path cover of Gt, , a path cover of Gt,, and a polynomial number of edges added at the 
root of T . By our induction hypothesis all the needed weights are already computed. 

The size of the circuit is polynomial since at each step the number of path cover descriptions 
is polynomially bounded once the M^-m-cliquewidth is bounded. □ 

Theorem 12. The sum of weights oj perfect matchings ofannxn symmetric matrix of bounded 
W -NLCwidth can be expressed as a circuit of size 0(n°(i)) and thus is in VP. 

Proof. Let M be an n x n symmetric matrix of bounded VF-NLCwidth. By G we denote the 
underlying, undirected, weighted graph for M. The circuit is constructed based on the parse-tree 
T for G. By we denote the subtree of T rooted at t for some node t G T. By Gt we denote the 
subgraph of G constructed from the parse-tree Tf. Let k be the ly-NLCwidth of G. We assume 
without loss of generality that T is a parse-tree on the set of labels {ai, . . . , Ofe}. 

The overall idea is much similar to that of Theorem 1111 namely to produce a circuit that 
computes the sum of weights of all perfect matchings of G. To obtain this there will be non- 
output gates that compute weights of all matchings of all Gt graphs, and then we combine these 
subresults. Of course, the total number of matchings can grow exponentially with the size of Gt, 
so we will not "describe" matchings directly by the edges participating in the covers. Instead we 
describe a matching of some Gt graph by the labels associated to the uncovered vertices. More 
precisely, for each matching of Gt and each label a we give the number of a- vertices which are not 
covered by the matching. Such a description do not uniquely describe a matching, because two 
different matchings of the same graph can have the same number of uncovered vertices which 
have the same labels associated. However, we do not need the weight of each individual matching. 
If multiple matchings of some graph Gt share the same description, then we just compute the 
sum of weights of these matchings. It is clear that the number of description needed is at most 

For a leaf vera- in the parse-tree T of G we construct a single terminal gate of con- 
stant weight 1, representing an empty matching. The description associated to this gate is 
((ai,0),...,(ai,l),...,(afe,0)). 

For an internal node i G T with operation oj^(_ff) we just need to change the description of 
terminal gates in the circuit contructed so far. More precisely, if the description of the gate was 
((ai,7ii), . . . , (aj,7ii), . . . , (afc,nfc)) then it becomes 

((ai, nj),...,{a„ ^ nj),...,{ak, ^ nj)). 

ajG_R-i(ai) ajeR-^{ai) ajeR-'-{ak) 

For an internal node i G T with operation H Xs H' the grammar rule describes which edges 
to add. We first create a multiplication gate using the values of each couple of terminal gates 
of the left child I of t and the right child r of t. It corresponds to the weights of the disjoint 
unions of the matchings of I and r. There is at most n^*^ such gates. To each gate, we associate 
a left and right description corresponding to the vertices from I and r. Those gates are the new 
terminal gates. We put the following total order ai < a2 < • • • < Ofe on the labels and the 
corresponding lexicographic order on the couples {at, aj). We will consider that the edges added 
via S are added by blocks corresponding to a couple {ai,aj) (All edges in the same block are 
added at the same time) and that all blocks of edges are added sequentially in lexicographic 
order. Thus we have at most fc^ steps of adding edges to consider. Suppose S{ai,aj) = Wij. 
For the step corresponding to {ai,aj) we obtain new matchings by considering each terminal 
gate go- Let ((ai, ni), . . . , (ai, n^), . . . , (a^, n^)) and ((ai, n'J, . . . , (a^ , n^), . . . , (ofe, n'j.)) be the 
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left and right description of go. Let n„iin — min{ni, n'j}. For all matching corresponding to go 



p), . . . , (ofc, nj.)). There are at most 2 • such new gates since p < n. Finally we make 

an addition tree computing the addition of the gates gp which have the same left and right 
description. Each such tree needs at most 0{{2k + 2) log(n)) new gates and there are at most 
2 • n^*^ trees. The outputs of these trees are the new terminal gates. When all the fc^ steps of 
adding edges are done we compute the description of each terminal gate as the sum of its left 
and right description then we put an addition tree computing the addition of the terminal gates 
which have the same global description. The outputs of these trees are the new terminal gates. 

Finally, we obtain the output of the circuit at the root node r of T. It is the output of the 
terminal gate with description ((ai, 0), . . . , (a^, 0), . . . , (a^, 0)). 

Proof of correctness: The first step of the proof is by induction over the height of the parse- 
tree T. We will show that for each node t of T there is for every matching description of Gt a 
corresponding gate in the circuit that computes the sum of weights of all matchings of Gt with 
that description. For the base cases - leaves of T - it is trivially true. 

For the inductive step we consider two disjoint graphs that are being connected with edges at 
a node t of the parse-tree T. Edges added at node t are only added in here, and not at any other 
nodes in T, so every matching of Gt can be split into 3 parts: A matching of Gt, , a matching of 
Gt^ and a polynomial number of edges added at node t. Consider a matching description along 
with all matchings of Gt that have this description. All of these matchings can be split into 3 
such parts, and by our induction hypothesis the weights of the path covers of Gj, and Gj^ are 
computed in already constructed gates. 

The number of new gates added for each operation H X5 i?' is at most 0{k^ ■ 71^*^+^). Since 
the number of these operations is at most n, we obtain a circuit of polynomial size. □ 

Theorem 13. The permanent of an nx n matrix of bounded W -m-cliquewidth can he expressed 
as a circuit of size 0(n°(i)) and thus is in VP. 

Proof. It is a direct consequence of Theorem [12] and Lemma [2j □ 
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6 Appendix 

Theorem 14. Every arithmetic formula can be expressed as the permanent of a matrix of W- 
cliquewidth at most 13 and size polynomial in n, where n is the size of the formula. All entries 
in the matrix are either 0, 1, constants of the formula, or variables of the formula. 

Proof. Let be a formula of size n. Due to [S] we know that ip can be expressed as the permanent 
of a matrix M that has treewidth at most 2 and size at most {n + 1) x {n + 1). Let G be the 
underlying graph of M and let T = {Vt,Et) be the 2-tree-decomposition of G. With only a 
linear increase in size of T we can assume that T is a binary tree-decomposition. 
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Based on the tree-decomposition T of G we construct a graph G' of bounded W-cHquewidth 
such that (with shght abuse of notation) per{G) = per{G'). A major difference between gram- 
mars for bounded treewidth matrices and grammars for bounded chquewidth matrices is that 
we cannot "merge" two vertices into a single vertex when deaUng with grammars for bounded 
chquewidth matrices. As a consequence the graphs G and G' will not be isomorphic, but there 
will be a 1 to 1 correspondence between their cycle covers. 

For every non-loop edge {u, v) of G there can be multiple nodes t G Vt such that u and v 
both are in the set Xt. We say that an edge {u, v) of G "belong" to a node f G Vr, if ^ is the node 
closest to the root of T where u and v both are in Xt (for every edge such a node is uniquely 
defined). 

The general idea for the construction of G' is as follows: We process T in a bottom-up 
manner. For a node t € Vt we first construct subgraphs representing the children I and r of t, 
then we add the edges belonging to t using a special labeling scheme for the vertices. We do 
not have a label in the grammar for each vertex of G because this will not result in a constant 
number of labels. Instead, since \Xi\ < 3 and \Xr\ < 3 we use labels to represent vertices in Xi 
and Xr and reuse these labels during the processing of T. 

A vertex v of G is represented through multiple vertices in G', but only two of them are 
"active" at any time during the construction of G': One vertex of indegree is managing edges 
leaving v in G, and one vertex of outdcgrcc is managing edges entering v in G. Since Xi and Xr 
both have size at most 3 we then need the following labels for this scheme: left-a-in, left-a-out, 
left-b-in, left-b-out, left-c-in and left-c-out (and 6 similar labels for right). In addition to that we 
also need a sink label, giving a total of 13 labels needed to construct G'. 

Processing T to construct G': For a leaf t of T we construct 6 vertices (or 4, if \Xt\ = 2), 
with the labels left-a-in, left-a-out, left-b-in, left-b-out, left-c-in and left-c-out (assuming t is the 
left child of its parent). For non-loop edges belonging to node t, e.g. a directed edge from the 
vertex represented with labels left-b-in/out to the vertex represented with labels left-a-in/out 
of weight w, we then add edges (actually just a single edge is added because both of the labels 
are only assigned to one vertex of G') from vertices with label left-b-out to vertices with label 
left-a-in of weight w. Next, if a vertex of G, e.g. the vertex represented by left-b-in/out, is not 
present in Xp {p being the parent of t inT), then we add an edge of weight 1 from left-b-in to 
left-b-out. Furthermore, if that vertex has a loop of weight w we add an edge of weight w from 
left-b-out to left-b-in. In both cases we then rename left-b-out and left-b-in to sink. 

For an internal node t & Vr (including the root of T) we first consider vertices of G that 
are in both Xi and Xr, e.g. left-a-in/out and right-b-in/out represent the same vertex of G. We 
assume that t is the left child of its parent in T. We add a loop of weight 1 to each of right- 
b-in and right-b-out. Then we add an edge of weight 1 from right-b-in to left-a-in and an edge 
of weight 1 from left-a-out to right-b-out. Then right-b-in and right-b-out are renamed to sink. 
Next we add two vertices to G' for every vertex in Xt that are not in Xi nor Xr. There will be 
"available" in/out labels for these two vertices, since in this case at least two other vertices were 
renamed to sink during processing of each child of t. Next we consider all edges of G belonging 
to t. Assume there is a directed edge from the vertex represented by right- c- in/ out to the vertex 
represented by left-b-in/out of weight w, then we add an edge of weight w from right-c-out to 
left-b-in. Last, if a vertex of G, e.g. the vertex represented by left-b-in/out, is not present in Xp 
[p being the parent of t in T) or if t is the root of T then we add an edge of weight 1 from left-b-in 
to left-b-out. Furthermore, if that vertex has a loop of weight w we add an edge of weight w 
from left-b-out to left-b-in. In both cases we then rename left-b-out and left-b-in to sink. 
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Proof of correctness: A vertex w of G is represented through two disjoint sets of vertices 
in G' : One set of vertices managing edges entering v in G, and one set of vertices managing 
edges leaving v in G. We denote these sets of vertices in G' as Vin and Vout- A vertex of G' 
belong to Vin if at some point during the processing of T it were assigned an in label which was 
representing v in G. By our construction it is clear that every vertex of G' belong to either Vin 
or Vout for exactly 1 vertex v of G, and the set Ui„ form a directed tree where all non-loop edges 
lead towards the root and have weight 1. All non-root vertices in this tree have a loop of weight 
1. The set Vout has equivalent properties, with the exception that non-loop edges lead towards 
the leaves instead of the root. 

Now consider two vertices u and u of G along with a directed edge of weight w from u to w, 
and consider the trees Uout and Vin in G'. At some point in the construction of G' an edge of 
weight w was added from a vertex in Uout to a vertex in in G', so there is a path of weight 
w from the root of Uout to the root of Vin and all vertices of Uout and Vin not in this path have 
a loop of weight 1. So in a cycle cover of G where we include the edge from u to w we then have 
an equivalent path in G' and all remaining vertices in Uout and are then covered by loops. 
In order to "continue" the construction of the path in G' we then also have an edge of weight 1 
from the root of to the root of Vout- In order to simulate loops in cycle covers of G' we have 
added an edge from the root of Vout back to the root of of same weight as the loop in G. So 
a loop in G corresponds to a cycle of length 2 in G', and then all other nodes in both Vm and 
Vout are covered by loops of weight 1. 

It is then easy to verify that cycle covers in G' are in bijection with cycle covers of G and 
the corresponding pairs of cycle covers have same weight. Finally, note that between any two 
vertices of G' there is at most 1 edge so we can find a matrix M' such that the underlying graph 
of M' is equivalent to G' and then per{M') — per{M). □ 

Theorem 15. Every arithmetic formula can be expressed as the hamiltonian of a matrix ofW- 
cliquewidth at most 34 and size polynomial in n, where n is the size of the formula. All entries 
in the matrix are either 0, 1, or constants of the formula, or variables of the formula. 

Proof. Let be a formula of size n. Due to [8] we know that can be expressed as the hamil- 
tonian of a matrix M that has treewidth at most 6 and size at most (2?! + 1) x (2?i + 1). Let 
G be the underlying, weighted, directed graph for the matrix M and let T — {Vt,Et) be the 
binary 6-tree-decomposition of G. With only a linear increase in size of T we can assume that 
T is a binary tree-decomposition. 

The overall idea is the same as in Theorem [Ml- namely to process the tree-decomposition T 
of G. Since all \Xt \ < 7 in this tree-decomposition we instead need at least 2 • 14 + 1 = 29 labels 
during the processing of T to construct G'. 

However, if we just use the exact same idea as in Theorem[SJ then for every cycle cover in the 
produced graph many vertices are covered through loops. Instead of introducing such loops we 
"eliminate" them using the same idea as in [13j used for showing universality of the hamiltonian 
polynomial. 

We need 5 additional labels for this construction: left-hl, left-h2, right-hl, right-h2 and temp, 
for a total of 34 labels. For a leaf t of T we start the processing of t by constructing two vertices 
and label them left-hl and left-h2 (assuming t is the left child of its parent in T), and add an 
edge of weight 1 from left-hl to left-h2. Remaining processing of t is done as before. 

For an internal node t of T we first add an edge of weight 1 from left-h2 to right-hl, rename 
left-h2 and right-hl to sink, and rename right-h2 to left-h2 (assuming t is the left child of its 
parent in T). Some vertices, e.g. the vertex with label right-c-in, may have a loop added during 
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the processing of t. Instead of adding such a loop we do the foUowing: Add a new vertex with 
label temp, add an edge of weight 1 from left-h2 to right-c-in, add an edge of weight 1 from 
right-c-in to temp, add an edge of weight 1 from left-h2 to temp, rename left-h2 to sink, rename 
temp to left-h2. Remaining processing of t is done as before. 

When we reach the root r of T we consider any vertex of Xr, e.g. the vertex represented 
by labels left- a-in/ out. In the final step, instead of adding an edge of weight 1 from left-a-in 
to left-a-out, we add an edge of weight 1 from left-a-in to left-hl and an edge of weight 1 from 
left-h2 to left-a-out. Now, for every hamiltonian cycle of G we break up the equivalent cycle of 
G' and visit any remaining vertices of G" along a path of total weight 1. □ 

Theorem 16. Every arithmetic formula can be expressed as the sum of weights of perfect match- 
ings of a symmetric matrix of W -cliquewidth at most 26 and .size polynomial in n, where n is 
the size of the formula. All entries in the matrix are either 0, 1, constants of the formula, or 
variables of the formula. 

Proof. It is a direct consequence of Theorem [14] and Lemma [2j □ 
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